Online Scheduling Technique To Handle Data Velocity Changes in Stream Workflows
نویسندگان
چکیده
Many IoT applications and services such as smart parking traffic control contain a network of different analytical components, which are composed in the form workflow to make better decisions. These workflows also known stream workflows. The focus existing research works is on streaming operator graph, differs from application it involves heterogeneity, multiple data sources outputs. Considering complexity dynamism workflow, meeting real-time analysis requirements at deployment time not whole story velocity changes over time. This change most dynamic that occurs frequently during execution this application. In article, we propose new scheduling technique manages cloud resources handle while maintaining user-defined minimising cost. efficiency proposed evaluated, experimental results showed outperformed its competitors close lower bound.
منابع مشابه
Algorithm to handle Concept Drifting in Data Stream Mining
Data Stream Mining is the evolving field of research. Mining continuous data streams brings unique opportunities but also new challenges. This paper will describe and evaluate the proposed classifier which uses ensemble classifier along with the boosting concept. Adaptive windowing is also used for handling the data stream. Empirical study will show that the proposed classifier takes less memor...
متن کاملOnline Input Data Reduction in Scientific Workflows
Many scientific workflows are data-intensive and need be iteratively executed for large input sets of data elements. Reducing input data is a powerful way to reduce overall execution time in such workflows. When this is accomplished online (i.e., without requiring users to stop execution to reduce the data and resume execution), it can save much time and user interactions can integrate within w...
متن کاملOnline approach to handle concept drifting data streams using diversity
Concept drift is the trend observed in almost all real time applications. Many online and offline algorithms were developed in the past to analyze this drift and train our algorithms. Different levels of diversity are required before and after a drift to get the best generalization accuracy. In our paper, we present a new online approach Extended Dynamic Weighted Majority with diversity (EDWM) ...
متن کاملA Multiversion Trajectory Data Warehouse to Handle Structure Changes
The data warehouse (DW) technology was developed to integrate heterogeneous information sources for analysis purposes. Information sources are more and more autonomous and they often change their content due to perpetual transactions (data changes) and may change their structure due to continual users' requirements evolving (schema changes). Handling properly all type of changes is a must. In f...
متن کاملOperator Scheduling in a Data Stream Manager
Many stream-based applications have sophisticated data processing requirements and real-time performance expectations that need to be met under asynchronous, time-varying data streams. In order to address these challenges, we propose novel operator scheduling approaches that specify (1) which operators to schedule (2) in which order to schedule the operators, and (3) how many tuples to process ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2021
ISSN: ['1045-9219', '1558-2183', '2161-9883']
DOI: https://doi.org/10.1109/tpds.2021.3059480